首页> 外文OA文献 >Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus
【2h】

Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus

机译:印地语语音的结构分析及其提取方法   来自非常大的印地语文本语料库的发音丰富的句子

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Automatic speech recognition (ASR) and Text to speech (TTS) are two prominentarea of research in human computer interaction nowadays. A set of phoneticallyrich sentences is in a matter of importance in order to develop these twointeractive modules of HCI. Essentially, the set of phonetically rich sentenceshas to cover all possible phone units distributed uniformly. Selecting such aset from a big corpus with maintaining phonetic characteristic based similarityis still a challenging problem. The major objective of this paper is to devisea criteria in order to select a set of sentences encompassing all phoneticaspects of a corpus with size as minimum as possible. First, this paperpresents a statistical analysis of Hindi phonetics by observing the structuralcharacteristics. Further a two stage algorithm is proposed to extractphonetically rich sentences with a high variety of triphones from the EMILLEHindi corpus. The algorithm consists of a distance measuring criteria to selecta sentence in order to improve the triphone distribution. Moreover, a specialpreprocessing method is proposed to score each triphone in terms of inverseprobability in order to fasten the algorithm. The results show that theapproach efficiently build uniformly distributed phonetically-rich corpus withoptimum number of sentences.
机译:自动语音识别(ASR)和文本语音转换(TTS)是当今人机交互研究的两个重要领域。为了开发人机交互的这两个交互模块,重要的是一组语音上很丰富的句子。本质上,这组语音丰富的句子必须涵盖所有可能均匀分布的电话单元。从大型语料库中选择这样的背景并保持基于语音特征的相似性仍然是一个具有挑战性的问题。本文的主要目的是设计标准,以便选择一组包含语料库所有语音方面的句子,且其大小应尽可能小。首先,本文通过观察结构特征来对印地语语音进行统计分析。此外,提出了一种两阶段算法,用于从EMILLEHindi语料库中提取具有多种三音素的语音丰富的句子。该算法包括一个测距标准以选择一个句子,以改善三音素的分布。此外,提出了一种特殊的预处理方法,根据反概率对每个三音进行评分,以固定该算法。结果表明,该方法有效地建立了句子数量最优,语音分布丰富的语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号